Results (page 10): Internet crawling 



Page 1 of 5 



RTAL 



USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: <• The ACM Digital Library C The Guide 
(internet crawling 




Sort results I publication date H ^ Save results. to a Binder 
hv l K — I pa 

, r ^ Search Tips 

expanded form ▼ r- ~ ... 
* K — 1 I Open results in a new 

window 



by 
Display 
results 



Feedback Report a problem Satisfaction 
survey 

Found 15,998 of 176,279 

Try an Advanced Search 

Try this search in The ACM Guide 



Results 181 

Best 200 shown 



200 of 200 



Result page: previous 1 2 3 4 5 6 



7 8 9 10 

Relevance scale □ U H B I 



181 Finding replicated Web collections 

Junghoo Cho, Narayanan Shivakumar, Hector Garcia-Molina 

May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 

conference on Management of data SIGMOD '00, volume 29 issue 2 
Publisher: ACM Press 

Full text available- f£| pdf(332 72 KB) Additional Information: full citation, abstract, references, citings, index 
^ ' terms 

Many web documents (such as JAVA FAQs) are being replicated on the Internet. Often 
entire document collections (such as hyperlinked Linux manuals) are being replicated 
many times. In this paper, we make the case for identifying replicated documents and 
collections to improve web crawlers, archivers, and ranking functions used in search 
engines. The paper describes how to efficiently identify replicated documents and 
hyperlinked document collections. The challenge is to identify these replicas ... 
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We have designed and implemented new Web browsing facilities to support effective 
navigation on Personal Digital Assistants (PDAs) with limited capabilities: low bandwidth, 
small display, and slow CPU. The implementation supports wireless browsing from 3Com's 
Palm Pilot. An HTTP proxy fetches web pages on the client's behalf and dynamically 
generates summary views to be transmitted to the client. These summaries represent 
both the link structure and contents of a set of web pages, using infor ... 
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Publisher: ACM Press 
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Full text available: to pdf(564. 90 KB) • 
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The Web is a rich source of information, but this information is scattered and hidden in 
the diversity of web pages. Search engines are windows to the web. However, the current 
search engines, designed to identify pages with specified phrases have very limited 
power. For example, they cannot search for phrases related in a particular way (e.g. 
books and their authors). In this paper we present a solution for identifying a set of inter- 
related information on the web using the 

185 A multiagent system for content based navigation of music 
^ David De Roure, Samhaa El-Beltagy, Steven Blackburn, Wendy Hall 

^ October 1999 Proceedings of the seventh ACM international conference on 
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D. Krishnamurthy, J. Rolia 

November 1998 Proceedings of the 1998 conference of the Centre for Advanced 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available:^ pdf(1 13.14 KB) Additional Information: full citation, abstract, references, index terms 

The cycle time of an Internet based online shopper includes time at an electronic 
commerce (e-commerce) server to gather information and purchase products, download 
time to transfer data over the Internet, and think time for interpreting the results of 
individual requests. Currently most home based shoppers are limited to 56. 6K modems 
and have cycle times largely determined by download time. Mega-bit (Mb) modems will 
soon be commonplace and will cause a significant reduction in the download time ... 
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Need to transfer files between your desktop and your laptop? Here's the easy way to do 
itnetworking 
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199 World Wide Web: a hypercharged view of the internet presenting a mosaic of 
information choices 
Tim Fitzgerald 

June 1995 ACM SIGUCCS Newsletter, volume 25 issue 1-2 
Publisher: ACM Press 

Full text available: Q pdf(431 -99 KB) Additional Information: full citation, abstract, index terms 

With the term information superhighway being bandied about as the latest political 
buzzword and the newest computing cliche, people who search for information 
electronically have to wonder if the global network of networks known as the Internet is 
the realization of this newest technological legend or just a precursor better relegated to 
terminology like the information bike path. First-time users of Internet services like e- 
mail, Archie, Anonymous FTP and Gopher may marvel at the unprece ... 

200 Is Information Systems Spending Productive? 
A. Lee Ridgway 
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Publisher: ACM Press 
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When your department is named Information Systems and you see an article entitled "Is 
Information Systems Spending Productive?" you tend to sit up and take notice. Such an 
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